Query-Based Keyphrase Extraction from Long Documents

نویسندگان

چکیده

Transformer-based architectures in natural language processing force input size limits that can be problematic when long documents need to processed. This paper overcomes this issue for keyphrase extraction by chunking the while keeping a global context as query defining topic which relevant keyphrases should extracted. The developed system employs pre-trained BERT model and adapts it estimate probability given text span forms keyphrase. We experimented using various sizes on two popular datasets, Inspec SemEval, large novel dataset. presented results show shorter with longer one without documents.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Query-Oriented Keyphrase Extraction

People often issue informational queries to search engines to find out more about some entities or events. While a Wikipedia-like summary would be an ideal answer to such queries, not all queries have a corresponding Wikipedia entry. In this work we propose to study query-oriented keyphrase extraction, which can be used to assist search results summarization. We propose a general method for key...

متن کامل

PositionRank: An Unsupervised Approach to Keyphrase Extraction from Scholarly Documents

The large and growing amounts of online scholarly data present both challenges and opportunities to enhance knowledge discovery. One such challenge is to automatically extract a small set of keyphrases from a document that can accurately describe the document’s content and can facilitate fast information processing. In this paper, we propose PositionRank, an unsupervised model for keyphrase ext...

متن کامل

Keyphrase extraction through query performance prediction

Previous research shows that keyphrases are useful tools in document retrieval and navigation. While these point to a relation between keyphrases and document retrieval performance, no other work uses this relationship to identify keyphrases of a given document. This work aims to establish a link between the problems of Query Performance Prediction (QPP) and keyphrase extraction. To this end, f...

متن کامل

A Distributed Framework for NLP-Based Keyword and Keyphrase Extraction From Web Pages and Documents

The recent growth of the World Wide Web at increasing rate and speed and the number of online available resources populating Internet represent a massive source of knowledge for various research and business interests. Such knowledge is, for the most part, embedded in the textual content of web pages and documents, which is largely represented as unstructured natural language formats. In order ...

متن کامل

Topical Keyphrase Extraction from Twitter

Summarizing and analyzing Twitter content is an important and challenging task. In this paper, we propose to extract topical keyphrases as one way to summarize Twitter. We propose a context-sensitive topical PageRank method for keyword ranking and a probabilistic scoring function that considers both relevance and interestingness of keyphrases for keyphrase ranking. We evaluate our proposed meth...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... International Florida Artificial Intelligence Research Society Conference

سال: 2022

ISSN: ['2334-0762', '2334-0754']

DOI: https://doi.org/10.32473/flairs.v35i.130737